Skip to main content

Fix Broken Links Web Crawls

These crawls are part of an effort to archive pages as they are created and archive the pages that they refer to. That way, as the pages that are referenced are changed or taken from the web, a link to the version that was live when the page was written will be preserved.

Then the Internet Archive hopes that references to these archived pages will be put in place of a link that would be otherwise be broken, or a companion link to allow people to see what was originally intended by a page's authors.

The goal is to fix all broken links on the web. Crawls of supported "No More 404" sites.

85,477
RESULTS
rss


Media Type
4
collections
85,456
web
17
data
Year
13,960
2018
20,891
2017
26,524
2016
10,741
2015
8,544
2014
4,796
2013
More right-solid
Topics & Subjects
85,456
crawldata
41,061
no404
22,182
wordpress
18,127
wikipedia
752
search
1
GDELT
More right-solid
Collection
More right-solid
Creator
85,456
internet archive
SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
GDELT
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Nov 9 23:09:50 PST 2018 to Sun Nov 11 10:38:26 PST 2018.
Topic: crawldata
Wordpress Blogs and the Pages They Link To
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl108.us.archive.org:no404 from Sun Nov 11 05:52:41 PST 2018 to Sun Nov 11 08:16:01 PST 2018.
Topics: no404, wordpress, crawldata
GDELT
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sun Nov 11 10:16:39 PST 2018 to Sun Nov 11 04:51:32 PST 2018.
Topic: crawldata
GDELT
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sun Nov 11 13:49:25 PST 2018 to Sun Nov 11 07:31:23 PST 2018.
Topic: crawldata
GDELT
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sun Nov 11 18:59:53 PST 2018 to Sun Nov 11 13:35:15 PST 2018.
Topic: crawldata
GDELT
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Nov 9 22:23:28 PST 2018 to Sun Nov 11 09:51:03 PST 2018.
Topic: crawldata
Wordpress Blogs and the Pages They Link To
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl105.us.archive.org:no404 from Sun Nov 11 10:58:52 PST 2018 to Sun Nov 11 12:09:25 PST 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
data
eye 1
favorite 0
comment 0
GDELT
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sun Nov 11 16:26:37 PST 2018 to Sun Nov 11 10:52:21 PST 2018.
Topic: crawldata
Wordpress Blogs and the Pages They Link To
data
eye 1
favorite 0
comment 0
GDELT
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sun Nov 11 17:38:56 PST 2018 to Sun Nov 11 12:06:34 PST 2018.
Topic: crawldata
Wikipedia Near Real Time (from IRC)
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl109.us.archive.org:no404 from Fri Nov 9 02:50:20 PST 2018 to Sat Nov 10 08:02:31 PST 2018.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
data
eye 1
favorite 0
comment 0
Wikipedia Near Real Time (from IRC)
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl896.us.archive.org:no404 from Sat Nov 10 10:59:52 PST 2018 to Sun Nov 11 09:25:27 PST 2018.
Topics: no404, wikipedia, crawldata
GDELT
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sun Nov 11 12:43:34 PST 2018 to Sun Nov 11 07:04:02 PST 2018.
Topic: crawldata
GDELT
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sun Nov 11 18:22:22 PST 2018 to Sun Nov 11 11:30:11 PST 2018.
Topic: crawldata
Wordpress Blogs and the Pages They Link To
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl107.us.archive.org:no404 from Sun Nov 11 08:42:55 PST 2018 to Sun Nov 11 08:33:37 PST 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl107.us.archive.org:no404 from Sun Nov 11 01:11:45 PST 2018 to Sun Nov 11 03:48:30 PST 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl105.us.archive.org:no404 from Sun Nov 11 04:38:50 PST 2018 to Sun Nov 11 06:22:34 PST 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl106.us.archive.org:no404 from Sun Nov 11 04:28:59 PST 2018 to Sun Nov 11 06:26:21 PST 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl106.us.archive.org:no404 from Sun Nov 11 11:07:07 PST 2018 to Sun Nov 11 12:07:53 PST 2018.
Topics: no404, wordpress, crawldata
GDELT
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sun Nov 11 15:12:29 PST 2018 to Sun Nov 11 09:34:51 PST 2018.
Topic: crawldata
GDELT
web
eye 1
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sun Nov 11 08:51:01 PST 2018 to Sun Nov 11 03:28:17 PST 2018.
Topic: crawldata
Wordpress Blogs and the Pages They Link To
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl107.us.archive.org:no404 from Sat Sep 15 13:34:19 PDT 2018 to Fri Sep 21 19:58:11 PDT 2018.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl105.us.archive.org:no404 from Sat Sep 22 19:50:10 PDT 2018 to Sun Sep 30 00:37:18 PDT 2018.
Topics: no404, wordpress, crawldata
GDELT
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sun Nov 11 05:54:55 PST 2018 to Sun Nov 11 02:10:54 PST 2018.
Topic: crawldata
Wikipedia Near Real Time (from IRC)
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl110.us.archive.org:no404 from Thu Nov 8 22:46:56 PST 2018 to Sat Nov 10 18:48:31 PST 2018.
Topics: no404, wikipedia, crawldata
GDELT
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sun Nov 11 04:09:44 PST 2018 to Sat Nov 10 23:16:51 PST 2018.
Topic: crawldata
Wordpress Blogs and the Pages They Link To
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl108.us.archive.org:no404 from Mon Sep 17 01:51:12 PDT 2018 to Sun Sep 23 19:59:29 PDT 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl105.us.archive.org:no404 from Sat Nov 10 21:44:33 PST 2018 to Sat Nov 10 23:51:43 PST 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl897.us.archive.org:no404 from Thu Sep 13 12:06:17 PDT 2018 to Thu Sep 13 05:35:26 PDT 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl108.us.archive.org:no404 from Sat Nov 10 15:38:41 PST 2018 to Sat Nov 10 18:43:40 PST 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl108.us.archive.org:no404 from Sat Nov 10 23:51:11 PST 2018 to Sun Nov 11 01:26:53 PST 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl106.us.archive.org:no404 from Sat Nov 10 22:09:30 PST 2018 to Sat Nov 10 23:39:13 PST 2018.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl107.us.archive.org:no404 from Wed Nov 7 13:37:23 PST 2018 to Wed Nov 7 10:19:49 PST 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl107.us.archive.org:no404 from Sat Nov 10 18:18:50 PST 2018 to Sat Nov 10 21:04:11 PST 2018.
Topics: no404, wordpress, crawldata
GDELT
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sun Nov 11 00:44:41 PST 2018 to Sat Nov 10 21:45:11 PST 2018.
Topic: crawldata
Wordpress Blogs and the Pages They Link To
web
eye 2
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl107.us.archive.org:no404 from Sun Oct 7 09:08:27 PDT 2018 to Wed Oct 10 22:04:06 PDT 2018.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl105.us.archive.org:no404 from Thu Nov 8 18:30:32 PST 2018 to Thu Nov 8 22:28:18 PST 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
data
eye 3
favorite 0
comment 0
Wordpress Blogs and the Pages They Link To
web
eye 3
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl105.us.archive.org:no404 from Wed Sep 19 04:21:40 PDT 2018 to Mon Sep 24 17:36:58 PDT 2018.
Topics: no404, wordpress, crawldata
Wikipedia Near Real Time (from IRC)
web
eye 3
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl898.us.archive.org:no404 from Fri Nov 9 18:55:49 PST 2018 to Sat Nov 10 16:01:12 PST 2018.
Topics: no404, wikipedia, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 3
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl897.us.archive.org:no404 from Thu Sep 13 12:35:21 PDT 2018 to Thu Sep 13 06:26:38 PDT 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 3
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl108.us.archive.org:no404 from Fri Nov 9 15:39:59 PST 2018 to Fri Nov 9 18:21:47 PST 2018.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl107.us.archive.org:no404 from Mon Nov 5 15:10:25 PST 2018 to Mon Nov 5 15:20:49 PST 2018.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl895.us.archive.org:no404 from Sun Oct 7 03:41:11 PDT 2018 to Sat Oct 6 21:20:17 PDT 2018.
Topics: no404, wordpress, crawldata
GDELT
web
eye 4
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Fri Nov 17 14:34:55 PST 2017 to Fri Nov 17 08:58:52 PST 2017.
Topic: crawldata
Wordpress Blogs and the Pages They Link To
web
eye 4
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl107.us.archive.org:no404 from Thu Sep 6 21:51:13 PDT 2018 to Thu Sep 13 11:17:46 PDT 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 5
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl108.us.archive.org:no404 from Sun Sep 9 11:21:13 PDT 2018 to Sat Sep 15 00:50:48 PDT 2018.
Topics: no404, wordpress, crawldata
GDELT
web
eye 5
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 9 03:33:34 PDT 2017 to Thu Jun 8 20:55:00 PDT 2017.
Topic: crawldata
Wordpress Blogs and the Pages They Link To
web
eye 5
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl107.us.archive.org:no404 from Wed Sep 19 13:46:39 PDT 2018 to Wed Sep 26 13:15:49 PDT 2018.
Topics: no404, wordpress, crawldata
GDELT
web
eye 5
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 9 04:05:05 PDT 2017 to Thu Jun 8 21:37:14 PDT 2017.
Topic: crawldata
GDELT
web
eye 5
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 9 04:32:40 PDT 2017 to Thu Jun 8 21:58:43 PDT 2017.
Topic: crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl107.us.archive.org:no404 from Mon Sep 24 11:07:41 PDT 2018 to Tue Oct 2 08:07:14 PDT 2018.
Topics: no404, wordpress, crawldata
GDELT
web
eye 6
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 9 13:17:54 PDT 2017 to Fri Jun 9 06:37:09 PDT 2017.
Topic: crawldata
Wordpress Blogs and the Pages They Link To
web
eye 6
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl106.us.archive.org:no404 from Mon Sep 17 20:42:36 PDT 2018 to Mon Sep 24 23:16:30 PDT 2018.
Topics: no404, wordpress, crawldata
GDELT
web
eye 6
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 9 03:01:50 PDT 2017 to Thu Jun 8 20:22:59 PDT 2017.
Topic: crawldata
Wordpress Blogs and the Pages They Link To
web
eye 6
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl106.us.archive.org:no404 from Sun Sep 9 23:06:58 PDT 2018 to Sat Sep 15 11:27:18 PDT 2018.
Topics: no404, wordpress, crawldata
GDELT
web
eye 6
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 9 03:11:23 PDT 2017 to Thu Jun 8 20:46:20 PDT 2017.
Topic: crawldata
GDELT
web
eye 6
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 9 04:21:33 PDT 2017 to Thu Jun 8 21:43:17 PDT 2017.
Topic: crawldata
GDELT
web
eye 6
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Wed Jun 7 13:57:26 PDT 2017 to Wed Jun 7 08:24:16 PDT 2017.
Topic: crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl106.us.archive.org:no404 from Thu Oct 4 10:18:44 PDT 2018 to Sun Oct 7 06:57:03 PDT 2018.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 6
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl107.us.archive.org:no404 from Wed Sep 12 02:21:12 PDT 2018 to Mon Sep 17 11:33:03 PDT 2018.
Topics: no404, wordpress, crawldata
GDELT
web
eye 6
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 9 03:49:53 PDT 2017 to Thu Jun 8 22:05:41 PDT 2017.
Topic: crawldata
GDELT
web
eye 7
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Wed Jun 7 18:03:27 PDT 2017 to Wed Jun 7 11:22:42 PDT 2017.
Topic: crawldata
GDELT
web
eye 7
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Wed Jun 7 16:12:48 PDT 2017 to Wed Jun 7 09:38:27 PDT 2017.
Topic: crawldata
GDELT
web
eye 7
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 9 03:18:56 PDT 2017 to Thu Jun 8 20:47:22 PDT 2017.
Topic: crawldata
GDELT
web
eye 7
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 9 02:41:07 PDT 2017 to Thu Jun 8 19:58:08 PDT 2017.
Topic: crawldata
GDELT
web
eye 7
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Wed Jun 7 16:27:14 PDT 2017 to Wed Jun 7 09:48:36 PDT 2017.
Topic: crawldata
GDELT
web
eye 7
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Wed Jun 7 16:55:03 PDT 2017 to Wed Jun 7 10:09:47 PDT 2017.
Topic: crawldata
Wordpress Blogs and the Pages They Link To
web
eye 7
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl895.us.archive.org:no404 from Sat Nov 3 03:59:52 PDT 2018 to Fri Nov 9 22:45:16 PST 2018.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl105.us.archive.org:no404 from Mon Nov 5 17:52:00 PST 2018 to Mon Nov 5 17:09:04 PST 2018.
Topics: no404, wordpress, crawldata
GDELT
web
eye 7
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jun 9 13:25:12 PDT 2017 to Fri Jun 9 06:52:16 PDT 2017.
Topic: crawldata
GDELT
web
eye 8
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Wed Jun 7 17:03:25 PDT 2017 to Wed Jun 7 10:22:14 PDT 2017.
Topic: crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl105.us.archive.org:no404 from Sun Nov 4 14:18:12 PST 2018 to Sun Nov 4 14:14:55 PST 2018.
Topics: no404, wordpress, crawldata